The DIR Net: A Distributed System for Detection, Isolation, and Recovery
نویسنده
چکیده
This document describes the DIR net, a distributed environment which is part of the EFTOS fault tolerance framework. The DIR net is a system consisting of two components, called DIR Manager (or, shortly, the manager) and DIR Backup Agent (shortly, the backup). One manager and a set of backups is located in the system to be `guarded', one component per node. At this point the DIR net weaves a web which substantially does two things: 1) makes itself tolerant to a number of possible faults, and 2) gathers information pertaining the run of the user application. As soon as an error occurs within the DIR net, the system executes built-in recovery actions that allow itself to continue processing despite a number of hardware/software faults, possibly doing a graceful degradation of its features; when an error occurs in the user application, the DIR net, by means of custom- and user-defined detection tools, is informed of such events and runs one or more recovery strategies, both built-in and coded by the user using an ancillary compile-time tool, the rl translator. Such tools translates the user-defined strategies into a binary `R-code', i.e., a pseudo-code interpreted by a special component of the DIR net, the Recovery Interpreter, rint (in a sense, rint is a r-code virtual machine.) This document describes the generic component of the DIR net, a function which can behave either as manager or as backup.
منابع مشابه
A framework backbone for software fault tolerance in embedded parallel applications
The DIR net (detection-isolation-recovery net) is the main module of a software framework fi)r the development of embedded supercornputing applications. This framework provides a set of functional elements, collected in a library, to improve the dependability attributes of the applications (especially the availability) . The DIR net enables these functional elements to cooperate and enhances th...
متن کاملENERGY AWARE DISTRIBUTED PARTITIONING DETECTION AND CONNECTIVITY RESTORATION ALGORITHM IN WIRELESS SENSOR NETWORKS
Mobile sensor networks rely heavily on inter-sensor connectivity for collection of data. Nodes in these networks monitor different regions of an area of interest and collectively present a global overview of some monitored activities or phenomena. A failure of a sensor leads to loss of connectivity and may cause partitioning of the network into disjoint segments. A number of approaches have be...
متن کاملOptimum Local Decision Rules in a Distributed Detection System with Dependent Observations
The theory of distributed detection is receiving a lot of attention. A common assumption used in previous studies is the conditional independence of the observations. In this paper, the optimization of local decision rules for distributed detection networks with correlated observations is considered. We focus on presenting the detection theory for parallel distributed detection networks with fi...
متن کاملOptimum Local Decision Rules in a Distributed Detection System with Dependent Observations
The theory of distributed detection is receiving a lot of attention. A common assumption used in previous studies is the conditional independence of the observations. In this paper, the optimization of local decision rules for distributed detection networks with correlated observations is considered. We focus on presenting the detection theory for parallel distributed detection networks with fi...
متن کاملWavelength region selection and spectrophotometric simultaneous determination of naphthol isomers based on net analyte signal
Naphthol isomers were simultaneously and spectrophotometrically determined in wastewater, using a model based on net analyte signal (NAS). The calibration method used is a variation of the original hybrid linear analysis method as proposed by Goicoechea and Olivieri (HLA/GO). Owing to spectral interferences, the simultaneous determination of mixtures of naphthol isomers, using a spectrophotomet...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1504.07281 شماره
صفحات -
تاریخ انتشار 2015